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MongoDB Fundamentals 


Learning Outcomes 


e Differentiate between SQL and NoSQL databases. 
e Describe the structure that MongoDB uses in storing data. 
e Create queries against a MongoDB instance to store, manipulate and retrieve data. 


What is MongoDB? 


MongoDB is an open-source document-oriented NoSQL database used for high-volume data 
storage. 


The term ‘NoSQL’ means ‘non-relational’. It means that MongoDB isn’t based on the table-like 
relational database structure but provides an altogether different mechanism for the storage and 
retrieval of data. This format of storage is called BSON ( similar to JSON format). 


NoSQL is used as an alternative to traditional relational databases. NoSQL databases are quite 
useful for working with large sets of distributed data. 


NoSQL databases can be categorized into four different types as given below: 
e Key-value (Redis, Riak) 
e Graph database (Neo4J, HyperGraph DB) 
e Document-oriented (MongoDB, CouchDB) 
e Big Table/Column families (Cassandra, HBase) 


Out of these four types, document database is the most popular and widely used database in 
today’s world. 


Instead of using tables and rows as in the traditional relational databases, MongoDB makes use 
of collections and documents. Documents consist of key-value pairs which are the basic unit of 
data in MongoDB. Collections contain sets of documents and functions which is the equivalent 
of relational database tables. 


MongoDB Examples 


Example 1 
Below is a simple MongoDB document structure: 


{ 
author: 'J.K Rowling', 


by: '1965', 
url: 'https://www.jkrowling.com', 
books: ['Harry Potter 1', 'Harry Potter 2'] 


One of the many advantages of MongoDB is that you can store your records without worrying 
about the data structure such as the number of fields or types of fields to store values. 


Example 2 
The below example shows how a document can be modeled in MongoDB. 


{ 

_id : <ObjectId>, 

CustomerName: Henry, 

Order: 

{ 

OrderID: 111, 
Product: Jeans Khaki, 
Quantity: 2 


How does MongoDB store Data? 


e MongoDB stores data in documents in spite of tables. You can change the structure of 
records (which is called documents in MongoDB) simply by adding new fields or deleting 
existing ones. 

e This ability of MongoDB helps you to represent hierarchical relationships, store arrays, 
and other more complex structures easily. 

e The database management system also provides high performance, high availability, 
easy scalability, and out-of-the-box replication and auto-sharding. 


Key Features 


e Dynamic Schemas: MongoDB’s dynamic schema provides a simple way to incorporate 
changes as application requirements change. These changes can be made to the 


database without affecting existing data or application code and without incurring 
downtime. 

e Operational Intelligence: MongoDB’s native map/reduce and aggregation framework 
provides insights in real-time for applications, going beyond the capabilities of batch 
analytics technologies like Hadoop and traditional BI tools. 

e Deployment Flexibility: MongoDB was built to work with commodity hardware and 
cloud architectures. Data is localized for queries to ensure performance is robust and 
predictable regardless of deployment size. 

e Simple Scale-Out: MongoDB is designed to be scaled across server clusters. As data 
volumes grow, organizations can simply add more nodes to their clusters, and MongoDB 
will balance the data seamlessly and automatically in the background. 

e Rich Querying: MongoDB supports a full query language and primary and secondary 
indexing, as well as a full-text search with Google-like syntax. 


Why and Where you should use Mongo DB? 


Below are a few of the reasons why one should start using MongoDB: 


e Big data: MongoDB is always preferred by the developers or project managers when 
our main concern is the deal with a large volume of data with high performance. If we 
want to insert thousands of records in a second, then MongoDB is the best choice for 
that. 

e Document-oriented: Since MongoDB is a NoSQL type database, instead of having data 
in a relational type format, it stores the data in documents. This makes MongoDB very 
flexible and adaptable to real business world situations and requirements. 

e Ad hoc queries: MongoDB supports search by field, range queries, and regular 
expression searches. Queries can be made to return specific fields within documents. 

e Indexing: Indexes can be created to improve the performance of searches within 
MongoDB. Any field in a MongoDB document can be indexed. 

e Replication: MongoDB can provide high availability with replica sets. A replica set 
consists of two or more mongo DB instances. Each replica set member may act in the 
role of the primary or secondary replica at any time. The primary replica is the main 
server that interacts with the client and performs all the read/write operations. The 
Secondary replicas maintain a copy of the data of the primary using built-in replication. 
When a primary replica fails, the replica set automatically switches over to the 
secondary, and then it becomes the primary server. 

e Load balancing: MongoDB uses the concept of sharding to scale horizontally by 
splitting data across multiple MongoDB instances. MongoDB can run over multiple 
servers, balancing the load and/or duplicating data to keep the system up and running in 
case of hardware failure. There is no need for any type of administrative work for 
performing any type of horizontal scaling in MongoDB. 


o Also, horizontal scaling (adding new columns) is not so easy process in any 
RDBMS system. But in the case of MongoDB, it is very much easy since it is a 
schema-less database and this type of work can be directly handled by the 
application automatically. 


Use Cases 


e Content Management and Delivery 
o Manage a diverse product catalog of content in a single data store that allows for 
quick changes and rapid response times without additional complexity from 
content retrieval systems. 
e Mobile and Social Infrastructure 
o MongoDB provides a high-availability, low-latency, agile, and scalable platform 
that allows for geospatial capabilities, real-time analytics, and global availability. 
e Customer Data Management 
o Utilize rich querying capabilities for real-time analytics on massive user bases 
with complex data models by using dynamic schemas and auto-sharding for 
horizontal scaling. 


As such, MongoDB is good for the below types of situations: 

E-Commerce type of product-based applications 

Blog and Content Management systems 

High-Speed logging, caching, etc in Real-time 

Need to maintain location-wise Geospatial data 

If the application is a loosely coupled mechanism — means design may change at any 
point in time. 


Key Components of MongoDB Architecture 


Below are a few of the common terms used in MongoDB: 


e id 
o This is a field required in every MongoDB document. The _id field represents a 
unique value in the MongoDB document. The _ id field is like the document’s 
primary key. If you create a new document without an _id field, MongoDB will 
automatically create the field. So for example, if we see the example of the above 
customer table, Mongo DB will add a 24 digit unique identifier to each document 
in the collection. 
e Collection 
o This is a grouping of MongoDB documents. A collection is the equivalent of a 
table that is created in any other RDMS such as Oracle or MS SQL. A collection 
exists within a single database. As seen from the introduction collections don’t 
enforce any sort of structure. 


Cursor 
o This is a pointer to the result set of a query. Clients can iterate through a cursor 
to retrieve results. 
Database 
o This is a container for collections like in RDMS wherein it is a container for tables. 
Each database gets its own set of files on the file system. A MongoDB server can 
store multiple databases. 
Document 
o Arecord ina MongoDB collection is basically called a document. The document, 
in turn, will consist of field names and values. 
Field 
o Aname-value pair in a document. A document has zero or more fields. Fields are 
analogous to columns in relational databases. The following diagram shows an 
example of Fields with Key-value pairs. So in the example below CustomerID 
and 11 is one of the key-value pair’s defined in the document. 


Data Modelling in MongoDB 


As we have seen from the Introduction section, the data in MongoDB has a flexible schema. 
Unlike in SQL databases, where you must have a table’s schema declared before inserting 
data, MongoDB’s collections do not enforce document structure. This sort of flexibility is what 
makes MongoDB so powerful. 


When modeling data in MongoDB, keep the following things in mind: 


What are the needs of the application — Look at the business needs of the application 
and see what data and the type of data are needed for the application. Based on this, 
ensure that the structure of the document is decided accordingly. 

What are data retrieval patterns — If you foresee heavy query usage then consider the 
use of indexes in your data model to improve the efficiency of queries. 

Are frequent inserts, updates, and removals happening in the database? Reconsider the 
use of indexes or incorporate sharding if required in your data modeling design to 
improve the efficiency of your overall MongoDB environment. 


Difference between MongoDB & RDBMS 


Below are some of the key term differences between MongoDB and RDBMS: 


Table Collection In RDBMS, the table contains the columns and rows which are 
used to store the data whereas, in MongoDB, this same structure 
is known as a collection. The collection contains documents that 
in turn contain Fields, which in turn are key-value pairs. 


Column | Field In RDBMS, the column denotes a set of data values. These in 
MongoDB are known as Fields. 


Embedded In RDBMS, data is sometimes spread across various tables, and 

documents to show a complete view of all data, a join is sometimes formed 
across tables to get the data. In MongoDB, the data is normally 
stored in a single collection, but separated by using Embedded 
documents. So there is no concept of joins in MongoDB. 





Platform And Language Support 


MongoDB also provides official support for a large number of programming languages and 
frameworks. Mongo Drivers are available for the following popular languages and Frameworks: 
C 

C++ 

C# and .NET 

Java 

Node.js 

Perl 

PHP, PHP Libraries, Frameworks, and Tools. 

Python 

Ruby 


MongoDB Limitations 


e In MongoDB, there is no provision for Stored Procedures or functions or trigger so there 
are no chances to implement any business logic at the database level which can be 
done in any RDBMS system. 


e It can’t handle complex transactions. 
e MongoDB is not as strong ACID (Atomic, Consistency, Isolation & Durability) as 
compared to most RDBMS systems. 
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